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ABSTRACT 

X-tree  is  tree-structured  network  of  single- 
chip  processors  designed  for  1985  technology.  An 
operatins  system  kernel  for  supervising  the  opera¬ 
tion  of  each  processor  was  designed  and  coded*  To 
test  the  coder  an  X-tree  emulator  w3s  written  to 
run  on  the  RDP  11/70  UNIX  system.  The  kernel 
internal  structure  3nd  the  emulation  techniques 
used  sre  presented  in  this  paper.  The  feasibility 
of  the  kernel  design  is  demonstrated  by  a  success¬ 
ful  emulation  of  the  X-tree  executing  3  simple 
user  program.  Suggestions  are  also  made  for 
future  extensions  of  this  work  to  add  more  func¬ 
tions  to  the  kernel  arid  tc  improve  the  range  and 
efficiency  of  the  emulation. 
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X-tree  is  tree-structured  network  of  single¬ 
chip  processors  designed  for  1985  technology*  An 
operating  system  kernel  for  supervising  the  opera¬ 
tion  of  each  processor  was  desisned  and  coded.  To- 
test  the  'code >  an  X~tree  emulator  was  written  to 
run  on  the  PDF'  11/70  UNIX  system.  The  kernel 
internal  structure  and  the  emulation  techniques 
used  are  presented  in  this  paper.  The  feasibility 
of  the  kernel  design  is  demonstrated  by  a  success¬ 
ful  emulation  of  the  X-tres  executing  a  simple 
user  prosram .  Suggestions  are  also  made  for 
future  extensions  of  this  work  to  add  more  func¬ 
tions  Ko  the  kernel  and  to  improve  the  range  and 
efficiency  of  the  emulation. 
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If  the  current  trends  in  MLSI  architecture  continue?  it 
will  be  possible  to  put  a  complete  computer?  including 
memory  end  communications  hardware?  on  a  single  chip  by  the 
mid-1980s  C6J,  To  explore  various  issues  in  the  construc¬ 
tion  of  large  scale  systems  from  these  single-chip  comput¬ 
ers?  such  a  system  is  currently  being  designed  at  the 
University  of  California  at  Berkeley.  Because  of  the  topol¬ 
ogy  or'  the  system?  it  was  named  X-tree. 

A  basic  description  of  X-tree  and  its  concepts  is  pro¬ 
vided  in  section  2  of  this  paper.  This  account  is  neces¬ 
sarily  very  brief  and  omits  details  not  needed  for  the 

understanding  of  this  particular  paper.  A  more  complete 

2 

treatment  can  be  found  in  C2?3?83.  — 

The  goal  of  the  work  reported  herein  was  to  define  -and 
implement  an  operating  system  kernel  C101  for  X-tree.  "At 
the  time  this  project  was  started?  almost  all  the  work  done 
on  X-tree  had  been  on  the  design  of  the  hardware  architec¬ 
ture  and  the  communications  network.  Very  little  work  oh 
software  had  been  done.  In  fact?  this  has  been  the  first 
real  attempt  to  produce  running  software  for  the  system.-;  - 
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ection  3  or'  this  psf-er  describes  the 


definition  and 


internal  structure  of  the  node  kernel.  In  section  4>  the 
various  techniques  used  to  test  end  run  the  kernel  on  the 
UNIX  system  are  discussed.  Finally  some  reflections  end 
closing  comments  ere  presented  in  section  5* 


if 
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Basic  Bcscrieiioo 


X-tree  is  s  connected  network  of  sinsle— chip  computers* 
Results  of  various  simulation  studies  C3D  have  led  to  the 
selection  of  a  topology  which  is  s  binary  tree  with  some 
additional  interconnections  at  each  level  of  the  tree*  Many 
different  schemes  for  these  additional  interconnections  have 
been  proposed.  Mo  final  selection  has  yet  been  made.  Pic¬ 
tured  below  is  an  X-tree  with  "full  rind1  irrtralevel  connec¬ 
tions  ■  ‘ 


!  2=1  Co! - i  3=1  lb! -- 


—  !  4=1 00b  ! - !  5=1 01b  !  - !  6=1 10b  ! - ------ 1 7=1-1  lb  i< 


Each  node  in  the  tree  is  a  full  scale  computer  wiib 
on-chi.-  memory  3nd  communication  hsro’wsret  All  external 
devices  (disks?  terminals?  etc)  for  the  system  are  connected 
to  the  downward  links  at  the  'leaves*  of  the  tree*  There, 
are  no  external  devices  directly  connected  to  the  upper 
nodes?  except  a  bootstrap-  device  connected  to  the  root  nod?* 


4 


s  nodes  in  the  tree  are  numbered  us ins  the  simple 
illustrated  in  the  diagram.  The  relationship  between 
hers  of  connected  nodes  has  s  convenient  property  ( in 
ary  representation) i 

s  number  of  a  left  child  is  the  number  of  the  parent 
th  an  additional  0  at  the  least  significant  end* 

e  number  of  a  right  child  is  the  number  of  the  parent 
th  an  additional  1  at  the  least  significant  end* 

has  any  particular  node  easy  to  locate. 


Hsrrh  30# 


1  0/0 


t*  ''i‘T  (WHW  «WWpw*  .  IWW*  I"I)'V  *fiw> 
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(2)  An  identifier  (ID)  which  is  unioue  within  that  leaf 
node?  dens  rated  bs  the  leaf  node  when  the  object  is~ 
crested, 

A  full  global  address  is  then  specified  by  an  object  NAME 
and  an  offset  within  the  object. 

NAME  -  NODE  +  ID 

ADDRESS  =  NAME  +  OFFSET  =  NODE  +  ID  +  OFFSET 

In  this  way?  no  global  table  is  needed  to  locate  any 
global  address.  A  request  for  data  is  routed  to  the  leaf 
node  specified  in  the  object  NAME.  That  leaf  node  then  maps 
the  ID  ,nd  offset  into  a  location  on  the  memory  device  and 
retrieves  the  reeui red  page. 
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3.  DascuxEtioo  of  the  tJcde  Kegoel 


3.1.  Eunciiooai  Deacriatioo 

The  •  rimary  goal  of  the  nods  kernel  is  to  make  possible 
the  construction  of  the  operating  system  and  user  programs 
ss  sets  of  communicating  concurrent  processes  Cll*  In  par¬ 
ticular  >  these  processes  must  be  able  to  communicate  with 
each  other  using  only  process  names?  not  locations  within 
the  tree. 


The  services  provided  by  the  kernel  are  necessarily 
Quite  primitive.  They  are  concerned  only  with  the  creation 
and  execution  of  processes  3nd  with  the  sending  of  messages 
from  one  process  to  another.  The  operating  system  layer 
constructed  over  the  kernel  will  provide  more  general  ser¬ 
vices  for  the  user. 

The  relationship  between  the  kernel  and  the  upper  level: 
operating  system  can  3lso  be  described  in  terms  of  resources 
managed  I 

1.  The  kernel  manages  node  CF'U  time  snd  inter-node  commun¬ 


ications*  -  -  -  -  *  -  ’<^y 

The  general  operating  system  manages  physical  devrces- 
(disks?  etc)  and  processor  assignments. 


To  accomplish  this?  the  kernel  is  implemented  as 
small  set  of  code  which  exists  in  each  node  of‘  the. treer 
must  perform  three  specific  tasks?  ,  ~ 


a 
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(1)  Manage  the  execution  cf  processes  running  on  that  node? 

<2)  Manage  the  interface  to  the  communications  controller)' 
and 

(3)  Identify  messages  to  send  them  to  their  proper  destina¬ 
tions  * 

In  the  discussions  which  follow*  no  distinction  is  made 
between  processes  which  are  part  of  the  operating  system  and 
processes  which  perform  sppI ications .  For  the  purposes  of 
>,he  Kernel*  all  such  processes  are  considered  as  user  1 
processes . 


<  i-' 
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3.2*  S±.cuc:tu.E3l  Overview 

As  pointed  out  above?  the  kernel  is  a  set  of  software 
which  resides  in  each  node  end  supervises  the  user  processes 
end  common icst ion  for  that  node*  It  is  composed  of  severs! 
P3rts ; 

1.  Independent  kernel  processes ? 

2.  A  set  of  device  drivers  for  the  communications  channel? 

3*  A  set  of  kernel  service  routines  invoked  by  users 
processes?  and  -  - 


various  subroutines  for  the  performance  of  specific-: 
tasks . 


i hese  processes  and  routines  can  be  grouped  into  three 

--  *  '  *r'*"ar 

blocks  corresponding  to  the  main  tasks  of  the  kernel i 


Process  Managemert!  subroutines  to  create  and  start 
user  processes?  kernel  service  routines  to  be  invoked 
by  user  processes. 

Communications  Management!  a  set  of  device- drivers ?  one 
for  each  'slot'?  wh.vch  move  messages  in  and;  out  of  .the 
node . 


3.  Message  Management!  the  main  process  of  the  kernel  and 
various  subroutines  for  manipulating  messages?  matches 
user  messages  to  their  destinations  and  interprets  ker¬ 
nel  messages  to  perform  remote  reouests*  -- -;-P 
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While  these  ere  convenient  gr our-s  for  understanding  the 
structure  of  the  kernel?  it  is  important  to  realise  that 
they  are  not  independent  of  each  other.  For  example?  when 
kernel  service  routines  are  called  by  the  user?  they  accom¬ 
plish  their  Job  by  in  turn  calling  routines  which  are  part 
of  the  message  management  group. 


Communication  between  the  three  groups  is  accomplished 
in  primarily  two  ways! 


1.  A  process  or  routine  in  one  group  may  call  a  routine  in 
another  group.  Of  course?  data  may  be  passed  both  ways 
as  arguments  and  return  values. 


Certain  processes  obtain  their  work  from  FIFO  Queues  of 
■work  packets'.  Other  processes  place  packets  in  these 
Queues  for  later  processing.  •  ;■ 


The  processes  of  the  kernel  (and  of  the  users  as  well) 
are  to  be  executed  by  a  hardware  monitor  suggested  by  Tim- 


nc^r-ssr 


'•3 .  Their  implementation  in  the  emulator  is 


described  in  chapter  4. 


The  kernel  implemented  for  this  project  has  ignored 
several  sources  of  complexity  in  the  X-tree.  First?  it  is 
envisioned  that  the  X-tree  system  is  to  be  indefinitely 
extensible?  hence  node  numbers  may  have  an  unbounded' 
length.  In  order  to  focus  on  other  issues?  a  fixed  length- 


node  number  of  16  bits  was  postulated  for  the  kernel.  This 
is  not  really  a  severe  restriction  since  this  length  wi*l*l 
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support  a  tree  of  16  levels*  s  total  of  65,535  nodes! 

The  second  simplification  is  far  more  serious?  the  pss- 
ins  of  X-tres  is  not  emulated.  Instead?  all  code  and  data 
structures  for  user  processes  are  included  directly  in  the 
emulated  node  that  the  process  is  running  in.-  The  addition 
of  paging  to  the  kernel  at  a  later  date  is  not  absolutely 
straightforward,  but  an  attempt  was  made  to  keep  jthe  code 
compatible  with  paging  and  the  use  of  full  global  addresses. 

■r 

A  model  of  user  processes  under  the  fully  paged  system  is 
described  in  appendix  £. 

It  is  also  assumed  that  all  messages  in  X-tree  are 
reliably  transmitted.  Later  versions  of  this  kernel  will 
need  to  handle  message  communication  failures. 
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3.3,  Process  Macissemeni 

The  Process  Manager  consists  of  all  the  code  necessary 
to  support  user  processes.  In  particular?  the  following 
parts  are  included! 

1-  Two  routines  for  starting  and  creating  user  processes. 

2-  A  set  of  kernel  service  routines  called  by  user  erb- 
g  rams . 

Each  user  process  is  represented  by  a  process  control 
block  <PCB>  of  the  form 


— >used  for  the  FREEPCB 
linked  list 

— .’^-identifies  the  process  globally 


— Mnternal  process  status?  such  -as 
condition  codes?  etc 
— >state  of  the  process!  IN-USE?  - 
UhITING-FOR-IO?  etc 
— >used  for  emulator  synchronization 


— >pointer  to  the  process's  list  of 
active  lOBs 


A  fixed  number  of  these  blocks  exist  in  each  node. 
This  sets  an  effective  upper  limit  on  the  nuinbeT  of 
processes  active  in  node  at  one  time. 


Each  process  can  be  identified  in  two  ways!' 


link  field 


full  process  name 


priority 


status  word  (psw> 


external  status  word 


semaphore  pointer 


rntt  i  c  ~  l  — .•  -  x  _ 

J.  =_*  1*  X  X  w  Kf  A  I  I  Vr  S  s 


name  of  parent 


program  counter 


stack  pointer 
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(1)  By  s  full  32-bit  name  which  is  composed  of 


(a>  the  node  number  of  the  node  that  the  process's 
work  space  is  on*  and 

(b>  an  ID  number  Guaranteed  to  be  unioue  within  that 
node . 

(2)  By  s  pointer  ( PIID  to  its  PCB?  this  is  used  only  within 

the  local  node  kernel. 

Translation  from  the  local  name  to  the  full  name  is 
simple*  it  can  be  retrieved  from  the  second  field  in  the 
PCB.  The  other  direction  is  a  little  more  complicated.  A 
hash  table  is  maintained  to  hold  pointers  to  all  sctiv.e  PCBs 
in  the  node ,  ~  ~ 

For  the  emulation?  an  additional  set  of  kernel  sub¬ 
processes  is  defined J  one  for  each  PCB  in  the  node.  Each 
can  be  thought  of  as  3  subprocess  reserved  to  execute  a 
user.  At  system  start  time?  each  of  these  subprocesses  -goes 
immediately  to  sleep.  Also?  all  of  the  F’CBs  are  linked 
together  in  a  list  of  unused  PCBs  (the  FREEPCB  Queue). 

The  creation  of  a  new  user  process  is  performed  by  the 
subroutine  KCRTUSR.  Its  implementation  is  straightforward. 

An  unused  PCB  is  obtained  from  the  FREEPCB  c?ueue  and  is 
filled  in  with  the  appropriate  information?  specifically  the 
priority?  starting  program  counter?  3nd  the  name  of  the 
parent  process.  A  unioue  full  name  for  the  new  process  is 
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then  gene rated .  stored  in  the  PCB  end 
passed  back  to  the  parent  process  for 


the  hash_  table* 
later  use* 


The  subroutine  KSTRTUSR  starts  a  specified  user  process 
and  is  eaualls*  simple.  Using  the  hash  table*  the  process  PCB 
is  located  and  the  corresponding  kernel  subprocess  (the  one 
dedicated  to  that  PCB)  is  awakened.  This  subprocess  then 

(a)  Runs  the  user  to  completion* 

(b)  Removes  the  process  name  from  the  hash  table* 

to)  Puts  the  PCB  back  on  the  FREEPCB  Queue  for  later  reuse* 
3  no 

<d)  Goes  back  to  sleep. 

Kernel  Service  Routines  -  Five  functions  are  provided 
in  the  present  kernel  as  service  calls  for  the  user*  These 
are  described  below. 


PCREATE  (NODE*  START,  PRIO*  PNAME)  '  '  ~- 

Definition:  A  child  process  is  created*  The  arguments 
NODE*  START*  and  PRIO  specify  the  node  number*  starting 
global  address  and  priority  of  the  new  process.  Once 
the  new  process  has  been  created*  its  full  process  name 
will  b=  returned  to  the  caller  in  PNAME. 

Implementation j  If  NODE  is  the  current  node  number* 
routine  KCRTUSR  can  be  called  directly*  If  not* 
a  message  is  formed  and  sent  to  the  kernel  process 
on  the  proper  node.  The  return  message*  which 
contains  the  new  process  name*  is  then  waited  for* 


PSTART  (PNAME) 

Definition:  Starts  the  execution  of  process  PNAME* 
where  PNAME  is  a  full  process  name. 

Implementation:  Similar  to  PCREATE  except  of  course 
that  KSTRTUSR  is  called  instead  of  KCRTUSR. 
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NPARENT  <  PNAME ) 

Definition!  Returns  in  PNAME  the  full  process  name 
of  the  parent  of  the  celling  process. 
Implementation!  Copies  it  from  the  proper  field 
in  the  celling  user's  PCB. 


MSEND  ( PNAME  t  IONBR,  LNG,  MSGADDR) 

Definition!  Sends  s  message  to  process  PNAME?  the 
message  is  taken  from  3  buffer  of  length  LN3 
starting  et  address  MSGADDR,  The  IONBR  is  en 
integer  which  mstches  the  IONBR  in  the  MRECV 
request  by  the  target  process. 

Implementation !  See  section  3, A  (Message  Manager) 


C 


MRECV  C  IONBR-  LNG t  MSGADDR) 

Definition!  Sets  up  to  receive  a  message  of 

maximum  length  LNG  starting  at  address  MSGADDR, 
The  calling  process  waits  until  the  message 
is  received.  Upon  return?  the  LNG  field  will 
contain  the  actual  length  of  the  message 
received. 

Implementation!  See  section  3,4  (Message  Manager) 
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3.4.  Message  Management 


The  message  manager  is  the  heart  of  the  kernel.  It  is 
comprised  of  the  main  kernel  process  and  a  set  of  subrou¬ 
tines  for  manipulating  messages. 

Each  active  message  in  the  node  is  represented  and 
managed  using  an  I/O  Control  Block  (IOB)  with  the  following 
format. 
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This  block  contains  all  th 


— >used  to  link  IOB  into  Queues 

— /link  for  a  process's  list  of 
active  IGBs 


— >used  to  match  incoming  messages 
with  the  proper  MRECV  reauest 
— >ststus  of  ID*  IN  vs  OUT. 

and  WAITING .  BUSY.or  COMPLETE 
— /encodes  what  msg  manager  should 
do  when  I/O  completes 
— >=  NULL  if  no  dynamic  buffer  used 


— >see  communication  manager 

e  information  needed  to  keep  track 


of  and  transfer  3  message. 


The  use  of  the  IOB  3nd  the  operation  of  the  message 
manager  is  best  shown  by  tracing  the  path  of  a  message  from 
one  user  process  to  another. 
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msS  mngrnt 
process 


!  wake 


Kernel 

service 


ACTIONS 


!  output. 


o  r i ve i 


The  above  diagram  illustrates  the  control  flow  for 
messsse  in  the  sending  node.  Step-by-step * 


The  kernel  csll  routine?  acting  on  behalf  of  the  user? 
uses  calls  to  message  management  subroutines  to  obtain 
and  fill  in  a  new  IOB. 


2  -  Once  the  JOB  is  filled  in?  if  is  placed  on  the  IQOUTG? 

3  FIFO  oueue  of  output  messages  waiting  to  be  transmitted 
The  kernel  routine  then  puts  the  user  to  sleep. 


--  The  message  is  picked  up  by  a  device  driver  tsee 

section  3.5?  communications  management)  and  transmitted* 
The  IOB  is  then  placed  in  ACTIONS?  the  main  kernel 
process's  Queue  of  work  to  be  done. 


The  main  i-  ernel  process  retrieves  the  ICB  from  the  ACTION 
and?  guided  by  the  ACTION  CODE  filled  in  by  step  1?  wakes 
up  the  user. 


The  user  process  wakes  up?  still  executing  in  the  kernel 
service  routine.  This  routine  retrieves  any  status  it 
needs  from  the  IOB?  then  releases  the  IQB  and  returns  to 
the  user. 
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when  3  NistssSe  coses  into  the  nods*  it  has  a  header 
which  contains  the  ts«  Set  process  name  and  an  10  nuabei 
< sec  section  3.5 *  communications  management'*  The 
routine  FINBluB  uses  the  hash  table  to  find  the  target 
process's  PCS  and  then  the  I OB  with  the  Batenins  10 


number.  After  the  message  transmission 


a  is  7 


mished* 


the  I0B 


"laced  in  ACTIONS ?  the  sain  kernel  process's 


C?'J“U0  or  WO  I'K. 


oe  done ♦ 


The  main  kernel  process  retrieves  the  I0B  from  the  ACTTOKQ 
and*  guided  be  the  ACTION  CODE  filled  in  be*  step  1?  wakes 
up  the  user. 


< 


5  -  The  user  process  wakes  up?  still  executing  in  the  kernel 
service  routine-  This  routine  retrieves  ana  status  it 
needs  from  the  JOB*  then  releases  the  I0B  and  returns  to 
‘he  user. 

In  addition  to  looking  at  completed  XOB's  to  see  what 
completion  actions  sust  be  taken*  the  asin  kernel  process 
performs  another  function.  It  is  the  target  process  for  any 
remote  service  reouests  from  other  nodes*  Uhen  a  message 
containing  auch  a  reouest  is  received?  the  actions  needed  to 
satisfy  the  reouest  are  done.  A  reels  message  which  con¬ 
tains  return  information  is  then  prepared  and  sent*  -  - 

Two  remote  reouests  are  provided*  create  new  process 
arid  start  process.  The  formats  of  the  message  bodies  are*  - 

htnu i c.  i-iiti. it 

;  reouest  I  process  !  process  !  10  no r  to  1 

d - !  code  l  priority  I  start  I  reply  to  I 

1=11  ;  3do  ress  I  I 
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REPLY  TO  REMOTE  CREATE 


!  reouest 

full  name  ! 

1  code 

of  proc  ! 

i  =  -I 

created  ! 

REMOTE  START 


i  reouest 

full 

i  IC  nbr  to  I 

!  code 

process 

!  reply  to  ! 

j  —  0 

name 

i  t 

REPLY  TO  REMOTE  START 


reouest 

code 


Of  course  use  is  3 Iso  made  of  information  in  the  message 
header?  so  thst  dets  is  not  duplicated  in  the  message  body. 


The  resulting  (simplified)  cycle  for  the  main  kernel 
process  is 


Do  forever 

Get  I OB  from  action  Queue 
If  it's  s  remote  reouest 

Do  the  reauested  action 
Reuse  the  I OB  to  send  reply 

else 

Do  comp a et ion  set ions  encoded  in  IOB 
(swsken  user?  relesse  IOB?  etc) 


THs  routines  to  obtain  and  release  lOBs  and  dynamic 
message  buffers  are  3lso  considered  part  of  the  message 
manager.  These  routines  are  straightforward  and  will  not  be 
discussed  here. 
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3,5,  ComcbUDj.C2tJ.orjs  idanas’smeQi. 


As  pointed  out  in  section  2.2*  the  controller  appears 
to  the  CF'U  ss  3  number  of  independent  devices  cslled 
’slots*.  Each  slot  is  controlled  through  3  section-,  of 
memory  defined  3S 


commsrid  !  statu 


current 
t  ransf  er 
3ddress 


current 

bate 

count 


interrupt 

address 


Currently,,  the  two  bits  READY  and  TERMINATE  sre  used  in  the 
command  bate.  Similarly*  the  ERROR*  END-OF-BUFFER*  snd 
END-OF-MESS.AGE  bits  3 re  defined  in  the  status  bate* 


There  are  two  sets  of  slots 5  one  set  for  input  3nd  one 
set  for  output.  These  slots  'Perste  in  3  ’direct  memory 
access’  fashion.  When  the  READY  command  bit  is  on*  the  slot 
transfers  data  into  or  out  of  the  buffer  defined  ba  the 
TRANSFER  ADDRESS  field.  For  each  bate  transferred*  the 
TRANSFER  ADDRESS  is  incremented  and  the  BYTE  COUNT  is  -decre¬ 
mented  . 


Several  conditions  cause  the  transfer  operation  to 


;topJ 


1.  An  error  is  encountered, 


2.  The  BYTE  COUNT  reaches  zero  (END-OF-BUFFER), 


The  end  of  message  is  found  (input  only). 
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whan  one  of  these  occurs?  the  fcllowing  steps  are  performed 
by  the  controller* 

1.  The  command  byte  is  cleared*  7' 

2*  The  appropriate  status  byte  is  turned  on* 

3.  An  interrupt  is  .Generated* 

4,  The  slot  waits  until  a  new  command  bit  is  turned  bn* 

Since  a  single  message  may  be  composed  of  multiple 
buffers  in  local  memory?  it  will  need  to  be  transmitted,  in 
several  parts.  Conseeuently ?  the  communications-  channel 
must  be  tcld  explicitly  where  one  message  ends  and  the  next 

t 

begins*  This  is  the  function  of  the  TERMINATE  command  oh  ah 
output  slot?  it  causes  the  termination  of  the  current  mes¬ 
sage*  The  next  transfer  operation  is  then  interpreted  as 
the  beginning  of  a  new  message.  On  an  input  slot?  the  ‘TER¬ 
MINATE  command  causes  the  remainder  of  the  current  *  message 
to  be  flushed.  ^ 

Any  message  has  the  standard-  format  •  — 


message 

header 


message 

body 


and  a  message  heeder  looks  like 


target 

10 

source 

source 

process 

id 

node 

process 

name 

number 

numbs  r 

name 
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Of  courser  the  "target  node  number1  field  is  used  by  the 
communication  hardware  to  route  the  message* 

The  slots  are  controlled  by  a  set  of  drivers  (implex 
merited  3s  processes)  r  one  for  each  slot.  The  output  drivers 
are  identical?  sharing  the  same  code  and  having  separate 
local  d3ta  stacks*  The  same  is  true  of  the  input  drivers* 

The  output  drivers  take  their  work  off  3  common  FIFO 
Queue  of  messages  (more  strictly?  IOBs)  waiting  to  be  sent 
out.  The  header  and  body  for  a  message  are  transmitted  as 
separate  buffers.  So  the  logic  for  an  output  drive  is* 


Do  forever 

Get  a  message  f  rom  the  output  Queue 

Set  up  slot  to  output  header 

issue  READY  command  and  W3it  for  interrupt 

Set  up  slot  to  output  message  body 

Issue  READY  command  and  wait  for  interrupt 

Issue  TERMINATE  command  and  wait  for  interrupt*. 

Send  I OB  to  message  manager 


The  logic  Tor  an  input  driver  is  a  little  more  compli¬ 
cated*  The  message  header  must  be  read  in  before  themssn 
sage  can  be  identified  and  the  proper  input  buffer  can  be 
selected.  Conseouently ?  a  dedicated  header  area  is  reserved 
fo  each  input  slot.  After  the  header  is  read  in?-  >ihe- 
correct  IOB  to  receive  the  message  must  be  identified  (see 
section  3.4  on  the  message  manager).  Once  this  is  done?  the 
rest  is  straightforward}  -** 
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Do  forever 

set  up  slot  to  reed  into  header  sree 
end  issue  READY  commend 
wait  for  interrupt 
locate  matching  IOB 

<or  create  one  if  target  is  kernel) 
copy  header  into  IOB 

set  up  slot  to  read  into  message  buffer 
and  issue  READY  command 
wait  for  interrupt 
send  IOB  to  message  manager 


After  both  input  and  output  operations?  the  message  IOB 
is  sent  back  to  the  message  handler  for  any  closing  actions 
that  may  be  required*  — - 


;  C 
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4.  IUE  EaULalluN  ESOGEAM 

4*1*  X-Iree  Nodes  as  UNIX  Erocesses 

In  order  to  test  and  debud  the  node  kernel *  some  method 
of  running  it  on  existing  hardware  is  needed*  At  this 
university*  the  most  convenient  way  to  do  this  is  to  run  an 
of  emulation  on  the  UNIX  system*  However*  the  emulation 
cannot  be  done  as  a  simple  UNIX  prodram*  Two  levels  of 
parallel  processirid  are  needed! 

(1)  The  X-tree  .  ias  more  than  one  node.  A  way  is  needed  to 
model  the  simultaneous  execution  of  the  X-nodes  and  the 
communication  between  them. 

(2)  The  kernel  of  the  X-node*  todether  with  the  user 
processes  running  on  that  node*  represent  a  set  of  con¬ 
current  subprocesses  executing  within  the  node. 

The  two  types  of  parallelism  are  different  in  one  very 
important  respect!  interprocess  communication*  The  X-nodes 
have  no  common  local  memory  through  which  to  communicate* 
The  kernel  subprocesses  *  on  the  other  hand*  rely  heavily  on 
memory  structures  that  are  in  the  local  node  and  are  avail¬ 
able  to  all  subprocesses  at  that  node. 

This  observation  leads  to  a  relatively  straightforward 
emulation  technique.  Each  X-node  is  modeled  by  a  UNIX  pro¬ 
cess  and  they  will  transmit  messages  using  UNIX  'pipes'* 
Modeling  of  the  kernel  subpiocesses  within  the  node  will  be 
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discussed  i r-  section  II  IB. 

It  would  be  possible  to  set  up  the  pipes  between  the 
UNIX  processes  so  thst  they  form  3  binary  tree  analogous  to 
the  X-tree  topology.  However?  the  point  of  this  exercise  is 
to  emulate  the  runnins  of  the  X-node  kernel?  not  the  commun¬ 
ications  links.  An  alternative?  which  greatly  reduces  the 
amount  of  interprocess  data  flow?  is  to  define  a  single  pipe 
into  each  process.  Any  message  is  sent  from  one  process  to 
another  by  simply  writing  into  the  input  pipe  for  the  target 
r.ode.  The  emulation  network  topology  is  then  fully  con¬ 
nected  and  looks  like; 


- -->pipe  to  node  1 

- >pipe  to  node  2 

♦ 

♦ 

- >pipe  to  node  n 


- >pipe  to  node  1 

- >pipe  to  node  2 

+ 

* 

- >pipe  to  node  n 


The  simple  use  of  pipes  does  not  solve  all  of  the  com¬ 
munications  problems.  Each  node  should  be  continuously- 
working  on  the  Jobs  it  has  to  do?  and  suspend  such  work  only 

when  there  is  a  message  on  the  pipe  to  be  read  in.  Unfor- 

# 

tunetely?  a  read  operation  on  an  empty  UNIX  pipe  suspends 
the  entire  process  until  some  data  is  placed  in  the  pipe  by 
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anotner  process*  Conseauantlv »  some  other  mechanism  is 

resui red* 


This  problem  can  be  solved  in  part  through  the  use  of 

UNIX  'kill'  signals  (see  appendix).  The  name  'kill'  is 

* 

somewhat  misleading  since  these  signals  can  be  'caught'  by 
the  process  receiving  the  signal.  To  accomplish  this?  the 
process  to  be  signalled  performs  the  system  call 


int  catch  ()» 

signal  ( signal -number >  catch)? 
and  continues  to  execute  its  other  tasks. 

Then?  when  another  process  sends  that  signal-number  to 
the  process  with  the  system  call 

kill  (process-number,  signal -number ) . 
the  function  catchO  will  be  asynchronously  called  in  the 
signalled  process.  This  call  is  like  any  other  in  that  oh 
exit  from  catchO.  the  signalled  process  will  continue  with 
the  code  that  was  being  executed  when  the  'k*ll'  signal  was 
received. 


One  more  aspect  of  catching  signals 
Once  a  signal  is  caught*  the  'signal' 
re-issued  to  catch  the  next  signal.  Our 
+em  code  then  looks  like 


is  very  important, 
system  call  must  be 
communication  sys- 


: 

§ 

1 
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m  r 


to  send  message; 

write  into  Pipe 

send  signal  (system  csll  'kill') 

to  receive  message; 

initialise  with 

"signal  < signal -number?  catch) " 

♦  *  ♦ 

function  CATCH; 

awaken  subprocess  to  reed  pipe 
return 

♦  ♦  * 

subprocess  INPUT; 
dc  forever 

read  and  process  message 
re-issue  'signal*  system  call 
So  back  to  sleep 


This  is  still  not  the  final  solution.  Since  3ny  number 
of  Processes  can  write  into  the  pipe  at  virtually  the  same 

instant.*  a  dangerous  condition  occurs.  If  two  processes  send 

* 

messages  into  the  pipe  in  ouick  succession*  the  receiving 
process  will  not  be  able  to  re-issue  the  signal  'catch* 
defore  the  second  signal  arrives.  If  this  happens?  UNIX 
will  3bort  the  receiving  process,  which  is  clearly  unaccept- 
abl  e . 


To  solve  this  problem,  the  X-tree  node  is  split  into 
two  UNIX  processes.  The  primary  one.  called  PROC.  does  most* 
of  the  work  and  operates  as  before.  However.  other 
processes  dc  not  write  directly  into  its  pipe.  Instead,  an 
intervening  Process  (called  COMM)  receives  messages  from 
other  nodes  and  sends  them  to  the  PROC  process  one  at  a  time 
using  a  simple  hand-shaking  protocol; 


srch  30. 


>  PROC  sets  to  catch  signal  from  COMM 
F'ROC  signals  COMM  that  it  is  ready 
for  message 

COMM  sets  message  from  its  pipe 
COMM  puts  message  into  private  pipe  to  PROC 
and  sends  signal  to  PROC 
PROC  receives  signal  from  COMM  then 
reads  and  processes  message 


Nodes  which  send  messages  are  now  relieved  of  the 
necessity  .  to  send  signals  after  writing  a  message  into  the 
pipe  for  another  node.  The  COMM  process  can  afford  to  wait 
when  it  reads  an  empty  pipe. 


The  resulting  final  topology  is 
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■  — pipe — -> 
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n 

- >pipe 
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to 
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> 
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i 

♦  * 

- >pipe 

to 

COMM  h 

r83Qb* 

signal 

Only  one  small  detail  still  needs  attention#  It  is-  net 
guaranteed  that  data  from  a  pipe  will  be  read  out  in  the 
same  units  that  they  were  written  in.  Messages  will  not  be 
shuffled#  but  they  may  be  received  more  than  one  at  a  tiifte* 
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When  3  Pipe  is  ready  all  the  messages  currently  in  the  pipe 
are  returned  by  that  single  read*  Hence*  we  must  reserve  a 
special  message  separator  byte.  The  COMM  process  searches 
for  this  separator  byte  and  splits  off  single  messages  to  be 
sent  to  the  F'ROC  process.  This*  of  course  will  not  be 
necessary  in  X-tree  itself*  since  the  end-of-messaSe  will  be 
transmitted  using  an  extra  bit* 
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4.2.  Suherocesses  and  Ibeir  SaDcbsonizaiioD 

The  use  of  multiple  UNIX  processes  is  s  satisfactory 
was  to  emulate  the  concurrent  operation  of  the  X-tree  nodes. 
Howeverr  this  teehnioue  will  not  work  for  modeling  the 
coroutines  running  on  a  single  node.  Separate  UNIX 
processes  do  not  share  any  global  address  space?  the  corou¬ 
tines  in  a  node  rely  heavily  on  such  a  global  data  space  for 
communication.  Implementing  such  a  data  space  as  a  disk 
file  would  be  both  awkward  and  ineff ieient. 

This  difficulty  implies  that  multiple  suhprocesses  must 
somehow  be  'faked"  within  a  single  UNIX  process.  The  UNIX 
system  offers  no  direct  solution  to  this  problem.  For¬ 
tunately  *  however*  software  to  do  Just  that  was  developed  as 
part  of  the  Toy  Operating  System  C7D  used  at  U  C  Berkeley  to 
teach  operating  system  principles.  This  code  has-been 
copied  and  used  in  the  X-tree  kernel  emulator.  The  package 
is  briefly  described  below. 

A  subprocess  is  defined  as  an  independent  execution 
stream  together  with  its  own  separate  local  data  stack.  All 
external  data  structures  and  variables  are  shared  among  all 
the  subprocesses.  To  implement  this  concept*  a  series  of 
data  stack  storage  areas  are  allocated*  one  for  each  subpro¬ 
cess.  At  any  moment*  the  data  stack  for  the  currently 
active  subprocess  is  found  in  the  normal  st.3ek  location 
(growing  down  from  address  0177777),  To  activate  a  dif ^ 
fereht  subprocess*  the  date  stack  for  the  outdoing; 
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subprocess  is  copied  into  its  stack  save  area*  The  data 
stack  for  the  new  sub-process  is  is  then  copied  in  from  its 
save  ares. 
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Step  2; 
load  new 
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Step  1 1 
save  old 
stack 


!  !  standard  !  ! 

It  it 

if  It 

- - >  •:  03  i 

*  » 

i  * 

It  is  important  to  note  that  the  on  Is  private  storas'e 
belonsms  tc  s  sub-process  is  in  its  local  data  stack.  All 

or  slobal  variables  are  shared  between 
sd  synchronization  must  be  handled  sxpI i- 

are  created  in  a  manner  analogous  to  the 
recesses.  The  call 


its 

spfork  < priority) 
ocstes  a  new  data  stack  save 


March  30 f 


stack 


old  one 


eh  (SKSCut 


rk  function 


nd  the  ps rent  receives 


iOnSeeucfivis  n'E 


sssaSni 


•3:  ==  0)  chil« 


rreste  s  child  suberocess  which  will  enter  the  function 


The  parent  sufcp recess  will  continue  with  the  next 


or  cooe. 


Execution  snd  synchronisation  of  subprocesses  is  con¬ 
trolled  thouSh  the  use  of  semaphores? with  the  P  and  V  opera¬ 
tions  defined  by  DiJkstra  C43.  These  operations  sre  well 
known  snd  will  not  be  described  here.  In  this  icple&ents- 
tion?  semaphores  are  represented  by  s  value  and  a  linked 
Queue  of  subprocesses  which  ere  waiting  for  the  sessrnore* 


Sen? spheres  ere  used  by  the  subprocesses  of  the  kernel 
emulator  in  basically  two  styles.  In  the  first  style*  3 
subprocess  one  rates  on  viork  packets  taken  from  a  FIFO  ausue* 
The  basic  cycle  for  such  a  suberoeess  is 


loop  forever  ■  -  - 

set  a  work  packet  from  sueoe 
process  work  packet 

of  the  operation  "set  work  packet"?  3  #P"  operation 


rforasc  on 


eleaents  in  the 


semaphore  which  contains  the  nuiber  of 


1''  the  Queue  is  empty?  the  suorro^ 


I  v”  »  i_  I  * 
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cess  then  waits  until  another  subprocess  puts  an  element 
into  the  Queue.  The  slot  output  drivers?  for  example? 
operate  this  way.  They  take  their  work  from  s  Queue ( IOOUTO) 
of  I/O  control  blocks  for  messages  which  are  weitins  to  be 


sent  out. 


The  second  style  of  "se  is  generally  called  private 
semaphores.  A  private  semaphore  is  pernanently  assigned  to 
a  particular  subprocess.  When  the  subprocess  must  wait  for 
some  reason?  it  does  a  P  operation  on  its  private  semaphore. 
It  then  stays  asleep  until  some  other  subprocess  wakes  it  up 
by  doing  a  V  on  its  (the  sleeping  subprocess's)  semaphore. 


!  ( 


One  example  of  this  style  of  semaphore  use  is  the  pro¬ 
cess  which  reads  messages  from  the  pips.  It  is  called  INP- 


OIPP-  which  stands  for  J NPut  Dispatcher,  It  waits  on  a 
private  semaphore  called  IOINT-  The  signal  from  the  COMM 
process  forces  the  execution  of  the  catch  routine  which  per¬ 
forms  a  V  on  JOINT?  thus  wak ins  the  JOINT  subprocess. 


The  subprocesses  for  the  kernel  and  hardware  emulation 
were  written  'on  top  of'  these  synchronisation  primitives* 
In  order  to  keep  the  kernel  cods  separate  from  the  subf=ro~ 
cess  3  implementation  code?  almost  no  changes  were  m3dt  in 
copying  the  subprocess  package  from  the  Toy  Operating  8ys~ 


One  change  was.  needed?  however?  because  the  P  and  V 
operations  must  be  indivisible  in  order  to  be  valid.  In  the 
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Toy  Operating  System ?  there  is  no  possibility 
asynchronous  interrupt  from  outside  the  program 
ciai  measures  have  to  be  taken.  This  is  not  the 


of  a  real 
so  no  see- 
csse  in  the 


X-tree  hernei  emulator.  The  COMM  process  may  send  the  'mes¬ 
sage  ir,  pipe'  interrupt  at  any  time.  As  described  above ? 
this  will  cause  an  immediate  call  to  a  subroutine  which 
wakes  up  the  INTDISP  process*  If  this  interrupt  arrives 
while  a  low  priority  subprocess  is  doing  a  P  or  V  operation? 
that  operation  would  be  suspended  in  the  middle  arid  an  error 


may  result. 

To  keep  this  from  happening?  it  is  desirable  to  disable 
the  interrupts  at  the  start  of  a  P  or  V  operation  and  reen¬ 
able  interrupts  at  the  e no  of  the  interrupt*  The  overhead 
of  actually  disabling  the  interrupt  would  be  unacceptably 
high?  since  it  would  involve  a  complicated  signal  protocol 
between  the  PPOC  and  0SMM  processes.  Instead?  the  interrupt 
is  received.-  but  the  awakening  of  the  INTDISP  subrrocess  is 


postponed . 


At  entry  to  a  P  or  V  operation?  a  flag  is  set  to  indi¬ 
cate  that  interrupts  are  disabled.  The  CATCH  routine  must 
check  this  flas  when  an  interrupt  occurs?  if  disabled?  the 
interrupt  must  be  noted?  but  INTDISP  cannot  be  awakened. 
Then*  oust  before  exit  from  the  P  or  v  operation?  the  inter¬ 
rupts  are  reenabled  and?  if  an  interrupt  occurred  in  the, 
meantime*  INTDISP  must  be  awakened. 

The  logic  for  this  then  looks  like 
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suhproeess  INTDISPJ 
loop  forever 

set  to  catch  interrupt  with  CATCh’O. 
signal  COMM  ready  for  interrupt 
wait  bs  P(IOINT) 
read  and  process  message 

interrupt  routine  CATCH? 

if  (intptdis)  intPtreci  -  TRUE? 
else  Vl JOINT) ? 

return? 

P  or  V  operation? 

intptdis  =  FALSE? 

do  operation  .. 
if  (intptrea)  M< IOINT) ? 
intptdis  =  intptrea  =  FALSE? 
return  ? 
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4.3.  Emulated  Commuciieatirc,^  tda.Edwa.ee 


The  eommuni cat ions  p  ^1  between  the  kernel  software 
and  the  communications  cor*  "oiler  was  discussed  in  detail  in 
chaster  2.  .Hew  is  the  teraelio.'i  between  the  slot  device 
drivers  and  the  channel  emulsteo  so  that  messages  can  be 
sent  though  the  UNIX  pipes? 


she  emulation  of  sr=  output  slot  is  relatively  straight¬ 
forward.  Each  time  a  device  driver  puts  a  new  command  in 
the  SLOT  COMMAND  BYTE ?  it  calls  the  OUTCOMM  subroutine. 
OUTCOMM  maintains  a  set  of  message  assembly  areas?  one  for 
each  slot.  when  nailed;  its  functions  are  simple? 


if  command  is  READY  FOR  TRANSFER? 

take  the  bytes  indicated  by  the  ADDRESS 
and  COUNT  fields  and  add  them  to  the 
end  of  the  message 

increment  the  ADDRESS  field  by  the  COUNT 
and  set  the  COUNT  to  zero 
place  END  OF  BUFFER  in  the  STATUS  field 
and  return 

if  command  .s  TERMINATE? 

put  th  MESSAGE  SEPARATOR  byte  at  the  end 
of  the  message 

write  the  message  to  the  appropriate  pipe 
clear  the  message  area 

place  END  OF  MESSAGE  in  the  STATUS  field 
and  return 


The  INTERRUPT  ADDRESS  field  in  tne  slot  control  buffer 
is  not  used  for  the  slot  output  emulation.  The  subroutine 
return  address  performs  that  function. 

The  emulation  of  the  input  slot  controller  cannot  be 
done  with  3  simple  subroutine  call.  The  output  slot  is 
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driven  by  events  occurring  inside  the  kernel  software 


The 


input  slot  *  on  the  other  hand?  is  driven  primarily  by  exter¬ 
nal  events?  namely  the  presence  of  bytes  and  messages  on  the 
input  line. 

The  emulation  of  the  input  slots  is  performed  by  the 
subprocess  INF'DISP?  which  has  already  been  mentioned  in  sec¬ 
tion  A  of  this  chapter.  INPDISP  is  implemented  as  a  subero- 
oess  which  is  awakened  whenever  there  is  3  message  available 
on  the  input  pipe.  After  reading  the  message  from  the  pipe? 
INPDISP  then  emulates  the  input  slot  controller  to  transfer 
the  message  into  the  kernel. 

Using  the  ADDRESS  and  COUNT  fields  of  the  selected 
slot?  INPDISP  transfers  bytes  from  the  message  into  the 
appropriate  ares  in  the  node.  When  3n  event?  such  as  BUFFER 
FILLED  or  END  OF  MESSAGE?  is  reached?  INPDISP  awakens  the 
appropriate  slot  driver  so  that  it  can  respond  to  the  event. 
The  slot  drivers  are  run  at  a  higher  priority  than  INPDISP? 
so  when  awakened  they  execute  immediately.  c 

When  INPDISP  gets  control  back?  it  means  that  the  input 
driver  has  finished  processing  the  last  event.  INPDISP  then 
continues  to  transfer  more  data  or  waits  for  another  mes¬ 
sage.  Hence  the  logic  for  INPDISP  ist 
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do  forever 

Wait  for  next  message  v 

Reed  in  message 

While  more  message  remains 

Use  ADDRESS  and  COUNT  fields  to  transfer 
bytes  from  the  message 
Adjust  ADDRESS  and  COUNT  fields 
If  more  message  remains* 

Set  STATUS  =  END  OF  BUFFER 
Awaken  slot  driver 
Set  STATUS  =  END  OF  MESSAGE 
Awaken  slot  driver 

This  results  in  a  back-and-f orth  action  between  the  kernel 
slot  driver  and  the  INF’DISP  process.  INPDISP  generates  an 
event  and  awakens  the  driver.  The  driver  reacts  to  the 
event  and  then  goes  back  to  sleep*  allowing  INPDISP  to  con¬ 
tinue.  It  is  important  that  the  driver  subprouess  have 


higher  priority  than  INPDISP.  If  INPDISP  had  the  higher 
priority ?  it  would  continue  executing  until  it  waited  for 


the  next  message. 


Here  again  the  INTERRUPT  ADDRESS  field  of  the  slot  con¬ 
trol  area  is  not  used  explicitly.  That  function  is  handled 


by  the  subprocess  package?  which  knows  at  wb3t  address  to 
continue  executing  the  driver  when  it  wakes  up.  Instead? 
the  space  of  the  INTERRUPT  ADDRESS  field  is  used  to  store 


the  address  of  the  private  semaphore  which  is  used  to  wake 
up  the  driver. 
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5 .  CONCLUSIONS 

5.1.  Evaluation  of  the  L'emal 

To  demonst rate  the  kernel  emulation  progrem?  3  simple 
use"  program  was  coded  and  sKecut.ed .  The  program  consisted 
of  three  user  processes J 


( 


uot 


Print  'Ucode  0  executing 
Create  3  process  on  node 
Start  the  r "ocess  Just  c 
Wait  for  3  message. 

Print  that  a  message  was 
display  the  message. 


on  node  X «  * 

1  to  execute  USER 
rested , 

received  and 


1 , 


USER  1  : 

Print  * Ucode  1  executing  on  node  X*‘ 

Create  a  process  on  node  2  to  execute  USER  2. 
Start  the  process  Just  crested. 

Print  ’Ucode  1  about  to  send  msg.* 

Send  message  ‘It  works!*  to  parent  process. 
Exit. 

USER  2: 

Print  'Ucode  2  executing  on  node  X.* 

Exit. 


The  'Kernel  calls'  made  to  start  processes  and  send  messages 
are  detailed  in  section  3.3. 


When  the  emulator  is  started?  it  sets  up  the  tree  and 
starts  the  kernel  in  each  node?  then  waits  for  a  command 
from  the  terminal.  The  terminal  operator  must  enter  com¬ 
mands  to  create  and  start  the  f 1  -st  user  process.  Detailed 
instructions  are  included  in  appendix  D. 


1 

sf 

Tins  S5 ssion  tor 

running  the 

simple  program 

above  is 

jj 

reproduced  below* 

Operator 

commands  are 

shown  in 
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parentheses  (the  parentheses  are  not  present  in  the  actual 
session ) * 


SESSION 


COMMENTS 


/.  xtree 


invoke  the  emulator 

command i < create  3  5  0> 

create  process  to  run  code 
segment  0  on  node  3  with 
priority  5 

Process  created  on  node  3 *  named  3  1  *** 
command i (start  3  1) 

start  the  process  oust  created 
Process  3  1  started  on  node  3 
Ucode  0  executing  on  node  3. 

Process  created  on  node  1?  named  1  1 
*% ¥  Process  1  1  started  on  node  i 
Ucode  1  executing  or:  node  1* 

Process  created  on  node  7?  named  2  1  **¥ 

% ¥¥  Process  2  1  started  on  node  2 
Uecde  1  about  to  send  msg. 

Mss'  received  by  Ucode  0  was: 

It  works ! 

Process  3  *  terminated*  node  3  'Mtt 
Process  1  I  terminated*  nods  1 
Ucode  2  executing  on  node  2. 

Process  2  1  terminated*  node  2 

The  messages  bracteted  by  '#*#'  are  informational  messages 


provided  by  the  kernel 


the  user  program. 


.*  he  oths 


messages  were  printed  by 


.Note  that  USES  2  didn't-  actually  execute  until  after 
USER  0  and  USER  1  had  finished.  Since  UNIX  actually  runs 
only  one  process  at  a  time?  there  is  no  real  parallelism* 
only  one  node  at  a  time  can  run.  Further*  close  examination 
of  the  user  program  reveals  that  USER  1  does  not  weit  for 
USER  2  to  finish  execution*  but  rather  to  Just  start  run¬ 


ning.  Hence?  the  seouenee  above  is  a  correct  one. 
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In  short?  this  demonstration  shows  that  the  kernel 
design  is  2 os i cal 1 y  correct  end  that  the  emulation  tech¬ 
niques  described  herein  actually  work*  However*  the  kernel 
functions  in  this  version  are  extremely  limited  and  much 
more  work  remains  to  be  done  before  it  reaches  its  final 
form  * 


The 


aMiOI  S 


success  of  ihs  kernel  design  is  its  fiexi- 
relatively  simple  and  easy  to  modify?  con- 


sistins  essentially  of  a  set  of  work  stations 
device  drivers)  which  retrieve  and  roce 
( p C B a  and  lOBs ) -  New  features?  such  as  those 
section  5.4?  can  be  implemented  in  some 
addins  new  options  in  existing  processes  or 
special  purpose  processes. 


(processes  and 
ss  work  packets 
discussed  in 
cases  be  simply 
by  adding  new 


Ha; 


O  \J  7 


vy 


*tD 


i .  2 .  Ihe  fiiiemsied  Use  of  MODULA 


In  Aug  78?  Harold  Roberts  completed  sn  evaluation  of 
the  MODULA  language  C7?iij-  He  concluded  that  MODULA  was 
viable  language  for  systems  programming  in  the  X-tree  pro¬ 
ject.  As  a  result  of  his  study ?  it  was  tentatively  decided 
that  MODULA  was  to  be  the  primary  programming  language  for 
X-tree . 


However?  some  problems  with  the  language  were  recog¬ 
nized.  Two  deficiencies  were  especially  important  to  the 
development  of  the  kernel. 


(1)  The  lack  of  pointers  forces  prodrams  to  be  written 
using  array  index  notation.  This  results  in  somewhat 
inefficient  code  and  occasionally  awkward  source  text. 


(2)  Althoush  MODULA  provides  for  the  explicit  expression  of 
multiple  processes  and  synch ronizati on ?  it  was  imple¬ 
mented  only  for  bare  ( no  underlying  operating  system) 
PDF  and  LSI-11 's. 


As  a  result  these  difficulties?  the  kernel  was  to  be 
designed  using  the  MODULA  language  and  then  translated  to  G 
for  testing  3nd  emulation.  As  work  progressed?  however? 


further  o  racisms 


're  discovered  which  led  eventually  to 


ibandoning  MODULA  altosether  for  the  kernel. 


The  first  problem  surfaced  during  the  design  of  the 
interface  to  the  communications  controller.  Although  it  is 
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actually  a  single  device?  it  acts  like  a  multiple  number  of 
identical  devices  (slots) ♦  One  straightforward  way  to 
interface  to  such  a  device  is  to  have  a  different  driver  for 
each  slot*  This  leads  to  relatively  simple  and  understand¬ 
able  code  since  the  alSori '•■hms  do  not  have  to  worry  about 
handling  multiple  slots  simultaneously.  But  this  approach 
is  only  practical  if  the  device  drivers  can  share  common 


code  and  each  can  have  a  private  pointer  to  the  control  ares 
for  its  slot.  If  this  is  not  possible?  the  amount  of  memory 


space  needed  for  duplicated  code  for  sil  the  drivers  would 
be  prohibitive* 


Unfortunately?  this  is  impossible  in  MODULA.  The 
addresses  of  device  registers  in  device  drivers  must  be 
declared  as  constants  and  cannot  be  passed  as  parameters  to 


I  il 


the  driver  when  it  is  initiated. 


The  second  problem  arose  when  the  algorithms  for  inter¬ 
preting  kernel  messages  wore  written.  When  3  message  is 
received?  its  contents  are  unknown.  It  is  only  after  the 
first  word  (REQUEST  TYPE)  of  the  message  has  been  examined 
that  the  format  of  the  message  is  known.  However?  it  is  now 
impossible  to  bresk  the  message  down  into  its  component 
parts  because  MODULA  is  strongly  typed  and  there  is  no  such 


ihing  as  a  variant  record. 


is  even  impossible  to  write  a 


ncn-hODULA  routine  to  accomplish  this  Since  there  is  no  was 
to  invoke  such  a  routine. 


The  only  alternative  in  MODULA  is  to*  define  all  of  the 
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Future  l3sks 


The  kernel  3  n  its  present  form  provides  sn  extremely 
limited  set  of  services.  However?  the  design  end  coding  W3S 
done  with  some  more  sene re 1  services  in  mind.  Several  pos¬ 
sible-  improvements  ere  discussed  below. 

A  I'D  ASYNCHRONOUS  MESSAGES  --  In  the  present  kernel?  3 
user  process  which  attempts  to  send  or  receive  a  message  is 
automatic  si Is  put  to  sleep  until  the  transmission  is  com¬ 
pleted.  While  this  is  suitable  for  some  applications?  some 
service  is  needed  whereby  a  user  could  issue  a  read  and  then 
continue  processing  and  periodically  check  to  see  if  a  mes¬ 
sage  has  ween  received.  This  service  would  also  allow  the 
user  to  issue  multiple  read  reouests  simultaneously  (the  10 
number  in  the  message  header  and  I OB  was  included  in  antici¬ 
pation  of  this  service), 

ABB  A  NODE  INTERNAL  CLOCK  --  A  method  of  delaying  3  user 
process  for  a  specified  period  is  needed*  The  design  of  3 
clock  driver  was  completed  in  MODULA  but  was  not  incor¬ 
porated  into  the  present  emulator  (the  code  is  reproduced  in 
appendix  E).  The  emulation  of  a  node  clock  might  be  accom¬ 
plished  by  incrementing  a  counter  at  specified  events?  such 
as  P  and  V  operations?  and  at  completion  of  each  idle  loop, 

PERFORMANCE  MEASUREMENT  -  It  mas  be  possible  to  instru¬ 
ment  the  emulator  so  that  experiments  can  be  performed  to 
compare  various  possible  ways  of  splitting  complex  user 
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APPENDIX  A  -  Kernel  Data  Structure  Die 3 rams 


Global  Memory  Object  NAME  5 


;  uni cue  10 


node  number  of  leaf  at  which  object 
resides 

guaranteed  uniaue  within  the  node 


Global  Memory  ADDRESS! 

!  obuect  name  ■ 

« _ _ I 

!  offset  « — >  offset  from  beginning  of  object 


Process  Control  Block  (PCB)J 
!  link  field 
!  full  process  NAME 
!  priority 


j  s  l/c  tui  W « 


\  external  status  word 
’  semaphore  pointer 
!  I OB  list  pointer 
I  NAME  of  parent 
I  program  counter 
!  stack  pointer 


—  "-used  for  the  FREEPCB 
linked  list 

— "-identifies  the  process  slobally 


— Mr.ternsl  process  status >  such  3S 
condition  codes >  etc 
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FORK (2) 


fork  -  spawn  new  process 


SYNOPSIS- 

fork (  > 


%  § 


DESCRIPTION 

Eork  is  the  on,ly  way  new  processes  are  crested*-  The  new--—- 
process's  core  image  is  a  copy  of  thst  of  the  caller  of 
fork*  The  only  distinction  is  the  fact  that  the  value 
returned  in  the  old  (parent)  process  contains  the  process™ I-p- 
of  the  new  (child)  process*  while  the  value  returned  in  the 
mil  j  is  0.  Process  ID's  range  from  1  to  30*000*  This  pro¬ 
cess  ID  is  used  by  wait (2)* 


1  i. 


Files  open  before  the  fork  are  shared*  and  have  a  common 
read-write  pointer*  In  particular  *  this  is  the  way  that 
standard  input  and  output  files  are  passed  and  also  how 
pipes  are  set  up* 

SEE  ALSO 

wait (2) *  exec(2) 


S  € 

!  L 
1  c 


DIAGNOSTICS 

Returns  -1  and  fails  to  create  a  process  if*  there  is  i nadeS- 
auate  swap  SP3ce*  the  user  is  not  super-user  and  has  too- 
many  processes*  or  the  system's  process  table  is  full*  Only 
the  super-user  can  take  the  last  process-table  slot, 


ASSEMBLER  (PDP-11) 

(fork  *  2, ) 
sas  fork 

(new  process  return) 

(old  process  return*  new  process  ID  in  r0) 


The  return  locations  in  the  old  and  new  process  differ  by 
one  word.  The  C-bit  is  set  in  the  old  process. if  3  new  pro¬ 
cess  could  not  be  crested.  „  -  -  5 


1  l 


Printed  3/30/79 
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KILL(2) 


NAME 


kill  ~  send  signal  to  s  process 


SYNOPSIS 

kill  (pid*  sia)  » 

DESCRIPTION 

Kill  sends  the  signal  sid  to  the  process  specified  by  the 
process  number  in  rO.  See  sidDal(2)  for  a  list  of  signals* 

The  sending  and  receiving  processes  must  have  the  same 
effective  user  ID?  otherwise  this  call  is  restricted  to  the 
supei — user. 

If  the  process  number  is  0*  the  signal  is  sent  to  all  other 
processes  m  the  sender's  process  group*  see  tty (4)* 

.  _  -  -  -w--  -  - 

If  the  process  number  is  -1?  and  the  user  is  the  super-user? 
the  signal  is  broadcast  universally  except  to  processes  6 
and  1*  the  scheduler  and  initialisation  processes-*  see' 
init (8) . 

Proccesses  may  send  signals  to  themselves*  ~ 


SEE  ALSO 

signal (2) *  kill(l) 


DIAGNOSTICS 

Zero  is  returned  if  the  process  is  killed*  -1  is  returned  if  ~* 
the  process  doss  not  have  the  same  effective  user  ID  and  the 

user  is  not  super-user*  or  if  the  process  does  not  exist*--  . 

-  _*  _ 

ASSEMBLER  (PDP-11) 

<  ki 1 1  =  37 . )  -  -  - 

(process  number  in  rO)  .  - 

sys  kill*  sid  ' 


Printed  3/30/-7? 
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PIPE (2) 


NAME 

pipe  -  create  sn  interprocess  channel 


SYNOPSIS 

pipe( f i Ides ) 
ir.t  fildssC?!!: 


DESCRIPTION 

The  Else  system  call  creates  sn  T/0  mechanism  cslled  a  pipe. 
The  file  descriptors  returned  can  be  used  in  read  snd  write 
operations.  When  the  pipe  is  written  using  the  descriptor  ’ 
fildesClO  up  to  4096  bytes  of  data  are  buffered  before  the 
writing  process  is  suspended.  A  read  using  the  descriptor* 
fildesCOl  will  pick  up  the  data. 

It  ?„s  assumed  that  after  the  pipe  has  been  set  up?  two  (or 
more)  cooperating  processes  (created  by  subsequent  fork 
calls)  will  pass  data  through  t>»e  pipe  with  read  and  write 
ca 1 1 s « 

The  Shell  has  a  syntax  to  set  up  a  linear  array  of  processes 
connected  by  pipes. 


Read  calls  on  sn  empty  pipe  (no  buffered  data)  with  only  one 
end  '.all  write  file  descriptors  closed)  returns  an  end-df^ 
file. 


SEE  ALSO 

sh(D?  read(2)?  write(2).»  fork(2> 


DIAGNOSTICS 

The  function  value  zero  is  returned  if  the  pipe  was  created? 
-1  if  too  many  files  3re  already  open.  A  signal  is  gen¬ 
erated  if  a  write  on  a  pipe  with  only  ona  end  is  attempted. 


BUGS 

Should  more  than  4096  bytes  be  necessary  in  any  pipe  among  a 
loop  of  processes?  deadlock  will  occur.  --  - 

ASSEMBLER  (PDP-11)  - 

(Pipe  =  42 .  ) 

sys  pipe  ■ 

(read  file  descriptor  in  r0)  *•/.. 

(write  file  descriptor  in  rl '  .  ..  — . — 


Printed  3/30/79 
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SIGNArL  (2) 


NAME 

signal  -  catch  or  ignore  signals 


SYNOPSIS 

♦include  <signsl .  h> 


(^signal  isigf  func>)()? 
( *f  unc  >  ( )  ? 


DESCRIPTION 


A  signal  is  generated  by  some  abnormal  event*  initiated 
either  by  user  at  a  terminal  (Quit?  interrupt ) ?  by  a  program 
error  (bus  error?  etc.)?  or  by  reauest  of  another  program 
(kill).  Normally  all  signals  cause  termination  of  the 
receiving  process?  but  a  signal  call  allows  them  either  to 
be  ignored  or  to  cause  an  interrupt  to  a  specified  location. 
Here  is  the  list  of  signals  with  names  as  in  the  -include  - 
file. 


5IGHUP 

1 

SIGINT 

o 

SIGQUIT 

SIGILL 

4* 

3IGTRAP 

5* 

SIGIOT 

6t 

SIGEMT 

7* 

SIGFPE 

8* 

SIGKILL 

9 

SIGBUS 

10* 

SIGSEGU 

11* 

SIGSYS 

12* 

SIGPIPE 

13 

SIGALRM 

14 

SIGTERM 

15 

16 

The  sta 

rred 

hangup 

interrupt 

Quit 

illegal  instruction  (not  reset  when  caught) 
trace  trap  (not  reset  when  caught) 

IOT  instruction 

EMT  instruction 

floating  point  exception 

kill  (cannot  be  caught  or  ignored) 

bus  error 

segmentation  violation 

bad  argument  to  system  call 

write  on  a  pipe  with  no  one  to  read  it 

alarm  clock 

software  termination  signal 
unassigned 


in  the  list  above  cause  a  core  image  if 


not  caught  or  ignored. 


If  func  is  SIG_DFL?  the  default  action  for  signal'  sig  is 
reinstated?  this  default  is  termination?  sometimes  with  a  ' 
core  imsse.  If  fuoc  is  5IG-IGN  the  signal  is  ignored. ^ 
erwise  when  the  signal  occurs  func  will  be  called  with  the;  - 
signal  number  as  argument.  A  return  from 'the  function  will^ 
continue  the  process  at  the  point  it  was  interrupted1*..  — ~- 
Exeept  as  indicated?  a  signal  is  reset  to  SIGi.DFL  after  " 
being  caught.  Thus  if  it  is  desired  to  catch  evervr-such 
signal?  the  catching  routine  must  issue  another  siljpal  eallH; 

When  a  caught  signal  occurs  during  certain  System  calls?  the 
call  terminates  prematurely.  In  particular  this  can  occur” 
during  a  read  or  uriie(2)  on  a  slow  device  (like  a-  terminal 


;  1 


Printed  3/30/7? 
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SIGNAL  ( 2 ) 


but  not  a  file)?  and  during  cause  or  uiai±(2>*  When  such  a 
signal  occursi  the  saved  user  status  is  arranged  in  such  a 
way  that  when  return  from  the  signal-catching  takes  place? 
it  will  appear  that  the  system  call  returned  an  error 
status*  The  user's  program  may  then?  if  it  wishes?  re- 
execute  the  call* 

The  value  of  signal  is  the  previous  (or  initial)  value  of 
func  for  the  particular  signal* 

After  a  £o.ck(2)  the  child  inherits  all  signals*  Exec(2) 
resets  all  caught  signals  to  default  action* 

SEE  ALSO 

kill(l)?  kill(2)?  Ptrace(2>?  setJmp<3> 

DIAGNOSTICS 

The  value  (int)-l  is  returned  if  the  given  signal  is  out  of 
range* 

BUGS 

If  a  repeated  signal  arrives  before  the  last  one  can  be 
reset?  there  is  no  chance  to  catch  it* 

The  type  specif ication  of  the  routine  and  its  func  .argument, 
are  problematical .  - 

j  VAX-11?  odd  values  for  func  are  the  same  as  SIG-^IGN* 

ASSEMBLtR  (PDP-11) 

(signal  =  48* )  * 

sys  signal?  sis?  label 
(old  label  in  rO) 

If  label  is  0?  default  action  is  reinstated.  If  label  is 
odd?  the  signal  is  ignored.  Any  other  even  label  specifies 
an  address  in  the  process  where  an  interrupt  is  simulated*- 
An  RTI  or  RTT  instruction  will  return  from  the  interrupt.  - 

NOTES  < VAX-11) 

The  following  defines  the  mapping  of  hardware  traps  -to  sig-rc 

ft  3 1 S  *  .  „v  . 


Arithemetic  traps! 

Integer  overflow  SIGFPE 
Integer  division  by  zero  SIGFPE 
Floating  overflow  SIGFPE 
Floating  underflow  SIGFPE 
Floating  division  by  zero  SIGFPE 
Decimal  division  by  zero  <  SIGFPE 
Decimal  overflow  SIGFPE 
SubscriPY.-rsnge  SIGFPE 


Printed  3/30/79 
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Access  control  (i«e*  protection 
violation) 


except  length  violation 
Translation  not  valid*  and 

SIGBUS 

Length  access  control 

SIGSEGU 

Reserved  instruction 

SIGILL 

Customei — reserved  instr* 

SIGEMT 

Reserved  operand 

SIGILL 

Reserved  sddressins 

SIGILL 

Trace  pending 

SIGTRAP 

Bpt  instruction 

SIGTRAP 

Compatibility-mode 

SIGEMT 

Chme 

3IGSEGV 

Chms 

SIGSEGO 

Chmu 

SIGBUS 

Printed  3/30/79' 
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APPENDIX  D  -  Instructions  for  Running  the  Emulator 


The  emulator  code*  in  both  source  and  executable  forms? 
is  in  the  directory  '/b/xtree/nei 1/kersim' .  The  instruc¬ 
tions  in  this  appendix  assume  that  this  is  the  current  work¬ 
ing  directory. 

1.  Sunoiod  the  Emulsion 

The  terminal  operator  must  start  the  emulator  by  enter — 
ins'  the  command 

£  xtree 

The  system  will  start  up  the  kernel  in  each  node  (UNIX  pro¬ 
cess)  and  prompt  when  its  ready  for  commands? 


command  1 


c 


To  start  the  execution  of  the  first  user  process?  two 
commands  must  be  entered  from  the  terminal? 


i.  To  create  a  process  to  run  the  code?  enter 
create  node  prio  pdm 

where  node  =  node  nbr  on  which  to  create  the  process 
prio  =  the  new  process's  priority 
pdm  =  the  user  code  nbr  of  the  desired  prodram 

The  system  will  print  a  process  name  to  be  used  in  the 
start  command. 


io  start  the  execution  of  the  process?  enter 


start  name 

where  n  ns  =  the  2-pa rt  process  name  displayed  when 
the  process  was  crest  d  = 


For  example?  the  simple  prodram  currently  in  the  emula¬ 
tor-  (section  5.1)  can  be  started  by  executind  user  code  0-on- 
node  3  with  priority  5? 

commanrficreate  350  -  ~— 

Process  created  on  node  3?  named  3  1 
command? start  3  1 


2.  Codiod  and  Liokios  Xour  Quo  User;  E’aodrams 

User  prodrams  are  written  in  C  and  are  run  as  subrou¬ 
tines  of  a  kernel  process  (see  section  3*3).  User  prodrams 
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s re  restricted  in  the  following  ways.  -  =; 

1*  No  input  operations  are  allowed*  Any  dsts  needed  miist 
be  received  via  messages. 


All  variables  3nd  dsts  must  be  local.  No  externals  are 
all owed . 

Printing  statements  may  be  included#  but  should  be 
framed  by  the  macros 


DISABLE  —  ■ 

erintf  (..»>?* 

ENABLE 

to  prevent  a  process  switch  when  the  stack  is  too  large 
for  the  dsts  stack  save  areas  (section  4*2). 

4,  Each  user  program  receives  ar?  integer  argument.  This" 
argument  must  be  used  as  the  first  argument  to  all  ker¬ 
nel  calls  from  the  user  program. 

The  kernel  calls  discussed  in  section  3.3  mas  be  used* 
Specific  examples  in  the  C  language  can  be  found  in  appendix 
B*  file  'sAucodes.c' .  „  '  ’ 

UNIX  does  not  support  dynamic  subroutine  linkage*  Con-* 
seouently#  any  user  programs  must  be  statically  linked  wi^h 
the  emulator  before  the  emulator  is  executed.  Withirr  the 
emulator#  each  user  program  is  referred  to  by  a  'user  code 
number7  (see  the  third  argument  of  the  'create'  command)-.  /' . 

The  inclusion  of  new  user  programs  takes  three  steps*' 

1.  Put  the  source  for  the  user  programs  into  the  file 

'sAucodes.c'.  -  - ~  - 

2.  .  Update  the  user  initialization  code  (file  's6pmsn%.6?»  ~- 

routine  'initusr')  to  assign  -the  program  nsnas  to tfii" 

•  _  V  ; 

table  of  routine  pointers  (array  'ucodeCU' >v  The  indak 
to  which  a  program  name  is  assigned  is  the  user  code- 
number.  -  ~  -  - -r  -  - 

-  -=  '  'l” 

_  - 

3.  Execute  the  UNIX  command  'make'.  ThisjLwill  do  ell  the 

necessary  compilation  and  linking  a ut omatic3lTyi — - 
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APPENDIX  F 


Hater i 3 


on  Future  Work 


:E33  HODEL 


i  n 


5  sent 


the  full  X-tree  swstea  with  paging f  how  is  3  user 
in  the  X— tree  to  be  represented  and  aeni puls ted?  I n 
represented  by  two  blocks?  the  Process 


P "OC0S3 


fPCB)  and 


f.his 


Process  Work  Object 


urn 


he 

-4  s 


le  r^b 
5i&S:0  Pi 

i  ta i ns  i 

r'OOSSii 

h.aek  Pt 


is  relative 

of  the  n 

le  in* Qritis w 
Xn  part 
Inter  (DSP) 


l»  saall  and  is  aaintained  in  the 
ids  in  which  the  process  is  running* 
Lon  needed  by  the  kernel  to  control 
Leuiar*  the  Frosra®  Counter  < FO  and 


s  re  K.6? 


ne  re  < 


The  r WO  rssidss  in  the  ^lobsl 


i 


(o  I 


ihs 


becking 


svore  if  you  prefer  to  call  it  that)  and  is  p-ased  intfi  thfr 
node  as  required-  It  contains  all  of  the  working  dst.3 
storage  for  the  process  and  is  consolidated  into  a  single 
object  which  can  be  viewed  as  having  two  parts*  The  low 
address  part  of  the  PWO  is  fixed  in  sise  and  holds  the  pro¬ 
cess  date  which  is  not  needed  in  the  PCS-  For  example?  the 
register  contents  for  the  process  would  be  kept  here* 


The  second  part  of  the  PUO  is  the  data  -stsck  for  the 
process.  It  extends  fr-oa  the  top  of  part  1  in  an  open  ended 
fashion.  This  siac)  is  used  in  the  conventional  aanner  for 
data  storage  and  for  block  entry  and  exit*  -  - 

PC3  (in  node) 


suo  (on  disk) 


top  of  stack 


registers? 

etc* 


Note  especially  that  there  is  no  notion 


ie  which  'belongs4 


hhe 


■rocess « 


of  a  block  of 
are  two  reasons 


for  this ! 

1*  All  X-tree  coda  is  pure  code  end  is  not  modifiable.  So 
there  is  no  need  for  a  separate  cops  for  each  user. 

2.  Linking  of  object  modules  will  be  done  dynamically  *at 
run  time  as  code  modules  are  called. 

As  a  result?  all  code  is  located  and  paged  in  to  the  node 
dynamically  as  needed  and  there  is  no  requirement  to  have 
separate  user  copies  of  any  code. 

Concomitant  with  the  lack  of  a  separate  'code  image' 
for  a  process  is  the  fact  that  the  PC  and  DSP  are  not  Just 
simple  local  addresses?  but  are  full  global  addresses  com¬ 
plete  with  global  object  names  and  offsets.  As  a  result? 
the  PC  identifies  the  object  which  contains  the  current  code 
for  the  process.  Similarly?  the  DSP  serves  the  dual  func¬ 
tion  of  stack  pointer  and  pointer  to  the  PUO  (this  is  the 
DSP  with  the  offset  masked  to  zero)* 

Since  the  PUO  for  a  process  never  changes?  the  'object 
name'  portion  of  the  DSP  is  constant  for  the  life  of  the 
process.  The  PC?  on  the  other  hand?  may  change  with  some 
frequency  as  code  in  new  objects  is  invoked.  In  this  csser 
the  old  PC  (including  object  name)  must  be  stored  in  the 
data  stack  to  be  restored  upon  return. 

Conceptually  then?  the  page  table  in  the  node  may  be 
viewed  as  a  table  of  translations  from  global  addressesto 
local  addresses! 


OBJECT  NAME 


Node  * 


Id  * 


0PFSE1 


Of  course?  this  table  is  implemented  using  hashing  in  order 
to  reduce  the  search  time.  Ac  itionally-  a  translation 
look-aside  buffer  is  used  to  eliminate  the  search  for  most, 
accesses  in  a  C3che-like  manner.  •  ■  -  ■ 

If  the  node  numbers  in  the  system  were  fixed  length? 
then  this  would  suffice.  However?  the  node  numbers’ in  the.: 
final  X-tree  may  be  unbounded  in  length  so  a  different' 
scheme  is  necessary  and  is  outlined  below.  .  - 


In  this  new  scheme?  all  the  active  object  names  for  a 
process  ere  centralized  .nto  one  place  (currently  called  the 
C-list  for  lack  of  a  better  name).  This  could  be  included 
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33  part  of  the  PWO  or  ss  an  extension  to  the  PCB*  The  vary¬ 
ing  length  node  numbers  could  then  be  managed  as  s  heap 
(either  one  per  process  or  3S  a  centre!  hesp  for  each  node).* 
The  key  here  is  that  esch  object  may  now  be  be  referred  "to 
by  a  fixed  length  index  into  the  C-list  and  the  process  can 
be  uniquely  identified  by  it's  index  into  the  table  of  PCBs  * 
laken  together  the  PCB  and  C-list  indices  are  a  fixed  length 
field  which  unieuely  identifies  an  object*  The  TLB  then 
looks  lik-et 


Local  Page  ♦ 

Proc  ID 

C-list  ind 

Page  # 

Of  course?  if  the  page  reference  is  not  found  in  the 
TLB*  the  software  riiust  first,  follow  the  chains  outlined 
above  to  determine  the  object  requested*  then  look  up  the 
local  page  number  in  the  same  manner  as  before*  The  local . 
address  is  then  given  to  the  TLB.  .  -  - * 


R 


R 


K 
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!.  a  MODULE  Clack/Delay  Dzivex- 


*'  'te'k ’k'fc'kif'k'k 'k'k  'k'k'k'k'kmk'k'k'k'k"k‘k'k'k'k'k'k'k'k'kmk'k'k'k'k  'ktkk‘i£'k'k'k*k'kik'k'k‘k'k,k'k*k'k*k’k*)km!k'!k*k 

x  ■  1%  *T>  T»  IJ*  *T>  .TV  "IN  »V*  V*  ^  >“  <T*  *p  *T^  «p  «p  rp  ip  «p  <p  ^  ^  rjk  ^  ip  *▼»  ip  *p  /p  ip  ip  ip  ip  ip  «p  ^  ip  ip  ip  ip  "P  ip  ip  ip  ip  ip  «p  P  ip  ip  p  ip  *P  P 

CLOCKDRIVER  device  module?  handles  clock  interrupts 
and  defines  the  means  for  delaying  a  user  process 
for  a  specific  length  of  time* 

i  ^  'i'  ‘-t  ^  ^  ^  ^  'i*  ^ ^  ^  ^  ^  ^  ^  i  ^  *■4'  ^  ^  ^  ^  ^  ^  ^  di  <4  ^  ‘X-  ‘X-  ‘4f  ^  ^  4*  'if  ^  ^  \ 

■P  ip  «p  "V*  ir  ^  -  [  ip  if.  ip  Ip  •'pi  ■  ,  ip  ip  l|>  ^  ^  ^  ip  «P  .p  ^  ip  iP  ^b  ip  ^  ip  ip  ^  ifb  ip  ip  ip  ^  ip  ip  ifb  iP  ip  ip  iP  ip  iP  ip  ip  ip  ip  ^  *  iP  iP  iP  ip  *P  f 


device  module  CLOCKDRIVER  C CLOCK-PRIORITY 3 i 
define  del  as ? 
use  NEtRRROCS *  awaken? 

pcbptr?  pcbtype?  pcb*  . -  - 

const  WQSIZE  *  NBRPRQCS+1? 

\  <P  i^  *p  »r  *T*  M?  ^  *1  *  *p  *p  ip  ip  «xb  iP  ^b  lP  *P  Ip  ^b  ^  q*  ip  ^  «p  «p  ip  ip  if.  ip  ^  ip  ip  ip  ip  ip  ip  ^b  iyw  ^  i^i  ip  >p  ^  qb  ip  ^b  iP  ^b  ^b 

Defines  a  Queue  of  processes  (managed  i  n  a- 
■  circular  fashion)  sorted  into  the  order  in 
which  they  are  to  be  activated*  Each  entry 
contains  a  PCS  pointer  and  the  proper  time-  to* 
delay  it  after  the  previous  process  has  been 
activated . 

'it ’k  -k  k  k  -k 

if*  ip  ■  *fb  ip  /P  .p  ip  ,p  .p  -p  iP  ^  p  P.  <p  ip  i**  *p  Jfi  ^  flb  ip  ^  ip  «T*  <p  ^  iP  if>  if1  *  ip  *P  iP  ip  ^T1  <P  <P  qb  <P  <P  T  *  ip  ip  *p  ip  ijv  g 

var  e ♦  array  OJWQSIZE-i  of  record 


nteser  ? 

end? 

f?  b?  integer? 
vti  integer? 


■{front  3nd  back  pointers}- 
■C length  of  oueuel 
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procedure  delsa  (proctpcbptr ?  'r<*  integer )  ? 
vsr  fi  r>  tJ  integer* 
begin 

if  n>0  then 

p  ;=  f •  r  i =  b 5  t  :=  0? 
while  (K>b'  end  ( t+nCp3  * dt<n>  do 
inc( t »  gCp3 , dt)  ? 

P  ;«  Cp+1)  nod  WQSIZE? 

end? 

while  pOp  do 

gC  r  3  t=  aC(r-I)  mod  WQSIZE3 ? 
r  !  =  ( r-1 )  mod  WQSIZE?  - 

end  ! 

aCp3.dt  !  =  n-t? 

GCpJiPcb  !  =  proc? 

PcbCproc] . psw  ;=  pcbCprocD > psw  or  Cl*33? 
dec(GCCp+l>  mod  WQSIZE3*dt»  Gtp3*dt)? 
b  i=  <b+l)  mod  WQSIZE? 
inc  (ct)  ? 


else 


s woken  (proc)! 


end  ? 

eno  tielse? 


process  clock  C CLOCK- INTERRUPT- ADDRESS 3 ? 

vs r  csr  C CLOCK -STATUS-REGISTER- ADDRESS 3 I  bits? 


vsr 

besin 

loop 


csr  163  t=  true! 

doio  5 

loop 


end  t 


eno  * 

end  clock? 


begin 

f  1=  0? 
b  ;  =  05 
ct  1=  0! 
clock!' 

end  CLOCKDRIVER? 


when  ct=0  exit? 
dec(oCf3 .  dt) * 
when  oCf3*dt>0  exit? 
3wsken  (oCf3«PCb)f 
f  J=  (f+l)  mod  WQSIZE? 
dec  (ct) ? 
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